Overview

Dataset statistics

Number of variables23
Number of observations1142
Missing cells10
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory472.6 KiB
Average record size in memory423.8 B

Variable types

NUM16
BOOL4
CAT2
URL1

Reproduction

Analysis started2020-04-20 17:30:53.734894
Analysis finished2020-04-20 17:31:57.721531
Versionpandas-profiling v2.6.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
name has a high cardinality: 361 distinct values High cardinality
age has 283 (24.8%) zeros Zeros
station_1 has 15 (1.3%) zeros Zeros
address has 16 (1.4%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

UNIQUE
Distinct count1142
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1132.140105078809
Minimum36
Maximum1902
Zeros0
Zeros (%)0.0%
Memory size9.0 KiB

Quantile statistics

Minimum36
5-th percentile165.05
Q1735.25
median1214.5
Q31578.75
95-th percentile1839.95
Maximum1902
Range1866
Interquartile range (IQR)843.5

Descriptive statistics

Standard deviation528.4209354
Coefficient of variation (CV)0.4667451785
Kurtosis-0.8302446649
Mean1132.140105
Median Absolute Deviation (MAD)440.5715109
Skewness-0.4939077278
Sum1292904
Variance279228.685
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 36. 60.5 146.5 254.5 309.5 ... 795.5 967.5 1393.5 1435.5 1902. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1902 1 0.1%
 
978 1 0.1%
 
971 1 0.1%
 
972 1 0.1%
 
973 1 0.1%
 
975 1 0.1%
 
976 1 0.1%
 
977 1 0.1%
 
979 1 0.1%
 
969 1 0.1%
 
Other values (1132) 1132 99.1%
 
ValueCountFrequency (%) 
36 1 0.1%
 
37 1 0.1%
 
38 1 0.1%
 
39 1 0.1%
 
40 1 0.1%
 
ValueCountFrequency (%) 
1902 1 0.1%
 
1901 1 0.1%
 
1897 1 0.1%
 
1896 1 0.1%
 
1893 1 0.1%
 

name
Categorical

HIGH CARDINALITY
Distinct count361
Unique (%)31.6%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
ザ・パークハビオ西大井
 
68
THE PREMIUM CUBE G 大崎
 
45
アルカーデ荏原中延
 
43
メゾンドアーク南大井
 
26
京急本線 立会川駅 13階建 新築
 
24
Other values (356)
936
ValueCountFrequency (%) 
ザ・パークハビオ西大井 68 6.0%
 
THE PREMIUM CUBE G 大崎 45 3.9%
 
アルカーデ荏原中延 43 3.8%
 
メゾンドアーク南大井 26 2.3%
 
京急本線 立会川駅 13階建 新築 24 2.1%
 
トーシンフェニックス五反田 22 1.9%
 
クレヴィスタ大森 22 1.9%
 
ディームス品川戸越II 16 1.4%
 
パークキューブ大井町 14 1.2%
 
リビオメゾン大崎 14 1.2%
 
Other values (351) 848 74.3%
 

Length

Max length30
Mean length12.82136602
Min length3
ValueCountFrequency (%) 
Other_Letter 196 67.6%
 
Uppercase_Letter 43 14.8%
 
Lowercase_Letter 25 8.6%
 
Decimal_Number 13 4.5%
 
Dash_Punctuation 3 1.0%
 
Other_Punctuation 3 1.0%
 
Space_Separator 2 0.7%
 
Close_Punctuation 2 0.7%
 
Final_Punctuation 1 0.3%
 
Open_Punctuation 1 0.3%
 
ValueCountFrequency (%) 
Han 118 40.7%
 
Katakana 70 24.1%
 
Latin 68 23.4%
 
Common 26 9.0%
 
Hiragana 8 2.8%
 
ValueCountFrequency (%) 
CJK 118 45.9%
 
Katakana 72 28.0%
 
ASCII 57 22.2%
 
Hiragana 8 3.1%
 
Punctuation 2 0.8%
 

real_rent
Real number (ℝ≥0)

Distinct count136
Unique (%)11.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean110209.06304728547
Minimum55000.0
Maximum149500.0
Zeros0
Zeros (%)0.0%
Memory size9.0 KiB

Quantile statistics

Minimum55000
5-th percentile85000
Q195000
median111000
Q3121000
95-th percentile142000
Maximum149500
Range94500
Interquartile range (IQR)26000

Descriptive statistics

Standard deviation18386.39703
Coefficient of variation (CV)0.1668319875
Kurtosis-0.2973305569
Mean110209.063
Median Absolute Deviation (MAD)14850.12422
Skewness-0.01320095434
Sum125858750
Variance338059595.7
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 55000. 79000. 84500. 87700. 89300. ... 119250. 119750. 120250. 124500. 149500.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
120000 43 3.8%
 
119000 34 3.0%
 
118000 32 2.8%
 
110000 31 2.7%
 
90000 30 2.6%
 
88000 25 2.2%
 
112000 24 2.1%
 
114000 23 2.0%
 
95000 23 2.0%
 
115000 22 1.9%
 
Other values (126) 855 74.9%
 
ValueCountFrequency (%) 
55000 2 0.2%
 
57000 2 0.2%
 
58000 1 0.1%
 
59500 1 0.1%
 
60000 1 0.1%
 
ValueCountFrequency (%) 
149500 1 0.1%
 
149000 7 0.6%
 
148000 13 1.1%
 
147000 4 0.4%
 
146000 6 0.5%
 

age
Real number (ℝ≥0)

ZEROS
Distinct count47
Unique (%)4.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.174255691768826
Minimum0
Maximum66
Zeros283
Zeros (%)24.8%
Memory size9.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median6
Q314
95-th percentile31
Maximum66
Range66
Interquartile range (IQR)13

Descriptive statistics

Standard deviation10.26700181
Coefficient of variation (CV)1.119110057
Kurtosis3.564887579
Mean9.174255692
Median Absolute Deviation (MAD)7.828625848
Skewness1.687347695
Sum10477
Variance105.4113261
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 29.5 31.5 34.5 48.5 66. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 283 24.8%
 
3 109 9.5%
 
2 66 5.8%
 
14 63 5.5%
 
16 52 4.6%
 
13 51 4.5%
 
10 49 4.3%
 
7 46 4.0%
 
4 38 3.3%
 
6 35 3.1%
 
Other values (37) 350 30.6%
 
ValueCountFrequency (%) 
0 283 24.8%
 
1 20 1.8%
 
2 66 5.8%
 
3 109 9.5%
 
4 38 3.3%
 
ValueCountFrequency (%) 
66 2 0.2%
 
56 1 0.1%
 
52 1 0.1%
 
49 1 0.1%
 
48 4 0.4%
 

height
Real number (ℝ≥0)

Distinct count18
Unique (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.600700525394046
Minimum2
Maximum29
Zeros0
Zeros (%)0.0%
Memory size9.0 KiB

Quantile statistics

Minimum2
5-th percentile3
Q15
median8
Q312.75
95-th percentile15
Maximum29
Range27
Interquartile range (IQR)7.75

Descriptive statistics

Standard deviation4.544831646
Coefficient of variation (CV)0.5284257523
Kurtosis0.5615853262
Mean8.600700525
Median Absolute Deviation (MAD)3.911747296
Skewness0.582996387
Sum9822
Variance20.65549469
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2. 5.5 6.5 7.5 8.5 ... 10.5 11.5 14.5 18.5 29. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5 145 12.7%
 
7 121 10.6%
 
4 114 10.0%
 
3 112 9.8%
 
12 106 9.3%
 
13 99 8.7%
 
15 96 8.4%
 
9 93 8.1%
 
14 79 6.9%
 
11 52 4.6%
 
Other values (8) 125 10.9%
 
ValueCountFrequency (%) 
2 48 4.2%
 
3 112 9.8%
 
4 114 10.0%
 
5 145 12.7%
 
6 12 1.1%
 
ValueCountFrequency (%) 
29 5 0.4%
 
24 1 0.1%
 
23 5 0.4%
 
22 1 0.1%
 
15 96 8.4%
 

level
Real number (ℝ)

Distinct count17
Unique (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.5683012259194395
Minimum-2.0
Maximum15.0
Zeros0
Zeros (%)0.0%
Memory size9.0 KiB

Quantile statistics

Minimum-2
5-th percentile1
Q12
median4
Q36
95-th percentile12
Maximum15
Range17
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.251396586
Coefficient of variation (CV)0.7117299024
Kurtosis0.5580302814
Mean4.568301226
Median Absolute Deviation (MAD)2.590655776
Skewness1.063865046
Sum5217
Variance10.57157976
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-2. 0. 1.5 4.5 7.5 10.5 15. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 186 16.3%
 
3 181 15.8%
 
1 169 14.8%
 
4 151 13.2%
 
5 98 8.6%
 
6 82 7.2%
 
7 70 6.1%
 
8 48 4.2%
 
9 39 3.4%
 
10 36 3.2%
 
Other values (7) 82 7.2%
 
ValueCountFrequency (%) 
-2 1 0.1%
 
-1 4 0.4%
 
1 169 14.8%
 
2 186 16.3%
 
3 181 15.8%
 
ValueCountFrequency (%) 
15 3 0.3%
 
14 15 1.3%
 
13 20 1.8%
 
12 24 2.1%
 
11 15 1.3%
 

area
Real number (ℝ≥0)

Distinct count379
Unique (%)33.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.0837880910683
Minimum10.752
Maximum432.0
Zeros0
Zeros (%)0.0%
Memory size9.0 KiB

Quantile statistics

Minimum10.752
5-th percentile19.235
Q120.812
median23.282
Q327.512
95-th percentile192
Maximum432
Range421.248
Interquartile range (IQR)6.7

Descriptive statistics

Standard deviation53.88915736
Coefficient of variation (CV)1.45317294
Kurtosis21.85872845
Mean37.08378809
Median Absolute Deviation (MAD)24.04981283
Skewness4.551702568
Sum42349.686
Variance2904.04128
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 10.752 16.017 19.526 19.997 20.057 ... 50.731 207. 217. 272. 432. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
23.982 55 4.8%
 
212 23 2.0%
 
20.512 23 2.0%
 
21.172 21 1.8%
 
20.612 21 1.8%
 
21.752 18 1.6%
 
26.972 17 1.5%
 
27.842 16 1.4%
 
28.712 12 1.1%
 
21.462 12 1.1%
 
Other values (369) 924 80.9%
 
ValueCountFrequency (%) 
10.752 1 0.1%
 
11.92 1 0.1%
 
12.212 1 0.1%
 
12.422 2 0.2%
 
12.972 1 0.1%
 
ValueCountFrequency (%) 
432 2 0.2%
 
422 1 0.1%
 
402 3 0.3%
 
382 2 0.2%
 
342 1 0.1%
 

route_1
Real number (ℝ≥0)

Distinct count10
Unique (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.189141856392294
Minimum0.0
Maximum10.0
Zeros7
Zeros (%)0.6%
Memory size9.0 KiB

Quantile statistics

Minimum0
5-th percentile1
Q14
median7
Q39
95-th percentile10
Maximum10
Range10
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.048130612
Coefficient of variation (CV)0.4924964854
Kurtosis-1.122751866
Mean6.189141856
Median Absolute Deviation (MAD)2.725003911
Skewness-0.5207875609
Sum7068
Variance9.291100227
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 2. 3.5 5.5 6.5 7.5 10. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
9 246 21.5%
 
8 210 18.4%
 
1 169 14.8%
 
5 116 10.2%
 
10 106 9.3%
 
7 99 8.7%
 
4 93 8.1%
 
3 92 8.1%
 
0 7 0.6%
 
6 4 0.4%
 
ValueCountFrequency (%) 
0 7 0.6%
 
1 169 14.8%
 
3 92 8.1%
 
4 93 8.1%
 
5 116 10.2%
 
ValueCountFrequency (%) 
10 106 9.3%
 
9 246 21.5%
 
8 210 18.4%
 
7 99 8.7%
 
6 4 0.4%
 

route_2
Real number (ℝ≥0)

Distinct count15
Unique (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.542907180385289
Minimum0.0
Maximum14.0
Zeros3
Zeros (%)0.3%
Memory size9.0 KiB

Quantile statistics

Minimum0
5-th percentile2
Q13
median6
Q310
95-th percentile13
Maximum14
Range14
Interquartile range (IQR)7

Descriptive statistics

Standard deviation3.65652833
Coefficient of variation (CV)0.5588537678
Kurtosis-1.134661073
Mean6.54290718
Median Absolute Deviation (MAD)3.106787183
Skewness0.2134264736
Sum7472
Variance13.37019943
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 4.5 ... 7.5 9.5 11.5 12.5 14. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 252 22.1%
 
5 177 15.5%
 
6 163 14.3%
 
10 153 13.4%
 
11 153 13.4%
 
7 104 9.1%
 
13 47 4.1%
 
1 27 2.4%
 
3 22 1.9%
 
14 21 1.8%
 
Other values (5) 23 2.0%
 
ValueCountFrequency (%) 
0 3 0.3%
 
1 27 2.4%
 
2 252 22.1%
 
3 22 1.9%
 
4 8 0.7%
 
ValueCountFrequency (%) 
14 21 1.8%
 
13 47 4.1%
 
12 9 0.8%
 
11 153 13.4%
 
10 153 13.4%
 

route_3
Real number (ℝ≥0)

Distinct count14
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.377408056042031
Minimum0.0
Maximum14.0
Zeros7
Zeros (%)0.6%
Memory size9.0 KiB

Quantile statistics

Minimum0
5-th percentile2
Q15
median7
Q310
95-th percentile13
Maximum14
Range14
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.970126961
Coefficient of variation (CV)0.5381465863
Kurtosis-1.215149067
Mean7.377408056
Median Absolute Deviation (MAD)3.502021218
Skewness-0.009185421141
Sum8425
Variance15.76190809
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 1.5 2.5 4.5 6.5 ... 9.5 10.5 11.5 12.5 14. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
10 228 20.0%
 
2 194 17.0%
 
6 162 14.2%
 
5 141 12.3%
 
13 117 10.2%
 
11 96 8.4%
 
7 73 6.4%
 
14 53 4.6%
 
1 38 3.3%
 
4 11 1.0%
 
Other values (4) 29 2.5%
 
ValueCountFrequency (%) 
0 7 0.6%
 
1 38 3.3%
 
2 194 17.0%
 
3 9 0.8%
 
4 11 1.0%
 
ValueCountFrequency (%) 
14 53 4.6%
 
13 117 10.2%
 
12 8 0.7%
 
11 96 8.4%
 
10 228 20.0%
 

station_1
Real number (ℝ≥0)

ZEROS
Distinct count31
Unique (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.134851138353765
Minimum0.0
Maximum31.0
Zeros15
Zeros (%)1.3%
Memory size9.0 KiB

Quantile statistics

Minimum0
5-th percentile2
Q17
median14
Q322
95-th percentile26
Maximum31
Range31
Interquartile range (IQR)15

Descriptive statistics

Standard deviation8.258914549
Coefficient of variation (CV)0.584294413
Kurtosis-1.22029565
Mean14.13485114
Median Absolute Deviation (MAD)7.23996982
Skewness0.01052599189
Sum16142
Variance68.20966952
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 2.5 3.5 5.5 6.5 ... 26.5 27.5 28.5 30.5 31. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
7 127 11.1%
 
25 109 9.5%
 
3 100 8.8%
 
10 90 7.9%
 
22 84 7.4%
 
12 83 7.3%
 
16 74 6.5%
 
23 64 5.6%
 
19 50 4.4%
 
1 40 3.5%
 
Other values (21) 321 28.1%
 
ValueCountFrequency (%) 
0 15 1.3%
 
1 40 3.5%
 
2 33 2.9%
 
3 100 8.8%
 
4 10 0.9%
 
ValueCountFrequency (%) 
31 9 0.8%
 
30 5 0.4%
 
29 2 0.2%
 
28 21 1.8%
 
27 2 0.2%
 

station_2
Real number (ℝ≥0)

Distinct count36
Unique (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.088441330998249
Minimum0.0
Maximum36.0
Zeros3
Zeros (%)0.3%
Memory size9.0 KiB

Quantile statistics

Minimum0
5-th percentile1
Q18
median14
Q323
95-th percentile33
Maximum36
Range36
Interquartile range (IQR)15

Descriptive statistics

Standard deviation9.901710979
Coefficient of variation (CV)0.6562447877
Kurtosis-0.9277577582
Mean15.08844133
Median Absolute Deviation (MAD)8.294001368
Skewness0.308826008
Sum17231
Variance98.0438803
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 18.5 20.5 31.5 32.5 36. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
9 133 11.6%
 
14 122 10.7%
 
1 101 8.8%
 
3 70 6.1%
 
12 66 5.8%
 
19 55 4.8%
 
20 42 3.7%
 
15 41 3.6%
 
27 39 3.4%
 
4 36 3.2%
 
Other values (26) 437 38.3%
 
ValueCountFrequency (%) 
0 3 0.3%
 
1 101 8.8%
 
2 26 2.3%
 
3 70 6.1%
 
4 36 3.2%
 
ValueCountFrequency (%) 
36 8 0.7%
 
35 24 2.1%
 
34 7 0.6%
 
33 30 2.6%
 
32 4 0.4%
 

station_3
Real number (ℝ≥0)

Distinct count37
Unique (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.816987740805605
Minimum0.0
Maximum36.0
Zeros7
Zeros (%)0.6%
Memory size9.0 KiB

Quantile statistics

Minimum0
5-th percentile2
Q19
median19
Q327
95-th percentile34
Maximum36
Range36
Interquartile range (IQR)18

Descriptive statistics

Standard deviation10.18276243
Coefficient of variation (CV)0.5715198651
Kurtosis-1.117949466
Mean17.81698774
Median Absolute Deviation (MAD)8.576108526
Skewness-0.0793814559
Sum20347
Variance103.6886507
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 2.5 4.5 8.5 9.5 ... 28.5 29.5 30.5 32.5 36. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
20 90 7.9%
 
3 88 7.7%
 
27 85 7.4%
 
18 73 6.4%
 
9 63 5.5%
 
4 62 5.4%
 
19 58 5.1%
 
15 53 4.6%
 
30 53 4.6%
 
28 46 4.0%
 
Other values (27) 471 41.2%
 
ValueCountFrequency (%) 
0 7 0.6%
 
1 24 2.1%
 
2 34 3.0%
 
3 88 7.7%
 
4 62 5.4%
 
ValueCountFrequency (%) 
36 17 1.5%
 
35 23 2.0%
 
34 28 2.5%
 
33 27 2.4%
 
32 13 1.1%
 

distance_1
Real number (ℝ≥0)

Distinct count17
Unique (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.143607705779335
Minimum1
Maximum20
Zeros0
Zeros (%)0.0%
Memory size9.0 KiB

Quantile statistics

Minimum1
5-th percentile2
Q14
median6
Q38
95-th percentile11
Maximum20
Range19
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.983220433
Coefficient of variation (CV)0.485581205
Kurtosis0.8953924816
Mean6.143607706
Median Absolute Deviation (MAD)2.384356569
Skewness0.623343444
Sum7016
Variance8.899604151
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 3.5 5.5 6.5 7.5 11.5 16.5 20. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
7 199 17.4%
 
5 172 15.1%
 
4 133 11.6%
 
3 103 9.0%
 
6 96 8.4%
 
9 86 7.5%
 
8 79 6.9%
 
2 70 6.1%
 
10 63 5.5%
 
11 59 5.2%
 
Other values (7) 82 7.2%
 
ValueCountFrequency (%) 
1 49 4.3%
 
2 70 6.1%
 
3 103 9.0%
 
4 133 11.6%
 
5 172 15.1%
 
ValueCountFrequency (%) 
20 3 0.3%
 
18 1 0.1%
 
15 8 0.7%
 
14 4 0.4%
 
13 14 1.2%
 

distance_2
Real number (ℝ≥0)

Distinct count20
Unique (%)1.8%
Missing3
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean9.600526777875329
Minimum1.0
Maximum26.0
Zeros0
Zeros (%)0.0%
Memory size9.0 KiB

Quantile statistics

Minimum1
5-th percentile4
Q17
median10
Q312
95-th percentile16
Maximum26
Range25
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.595696471
Coefficient of variation (CV)0.3745311642
Kurtosis-0.303050751
Mean9.600526778
Median Absolute Deviation (MAD)2.929402977
Skewness0.2615388278
Sum10935
Variance12.92903311
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10 158 13.8%
 
6 137 12.0%
 
8 132 11.6%
 
12 104 9.1%
 
11 89 7.8%
 
9 86 7.5%
 
16 83 7.3%
 
7 77 6.7%
 
5 55 4.8%
 
14 55 4.8%
 
Other values (10) 163 14.3%
 
ValueCountFrequency (%) 
1 5 0.4%
 
2 9 0.8%
 
3 17 1.5%
 
4 42 3.7%
 
5 55 4.8%
 
ValueCountFrequency (%) 
26 1 0.1%
 
19 1 0.1%
 
18 7 0.6%
 
17 3 0.3%
 
16 83 7.3%
 

distance_3
Real number (ℝ≥0)

Distinct count21
Unique (%)1.9%
Missing7
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean12.311894273127754
Minimum1.0
Maximum34.0
Zeros0
Zeros (%)0.0%
Memory size9.0 KiB

Quantile statistics

Minimum1
5-th percentile6
Q110
median13
Q314
95-th percentile18
Maximum34
Range33
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.68939438
Coefficient of variation (CV)0.2996609862
Kurtosis0.4893252366
Mean12.31189427
Median Absolute Deviation (MAD)3.006345941
Skewness0.003285057741
Sum13974
Variance13.61163089
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
14 226 19.8%
 
8 117 10.2%
 
12 107 9.4%
 
15 95 8.3%
 
10 95 8.3%
 
18 94 8.2%
 
11 77 6.7%
 
13 74 6.5%
 
9 67 5.9%
 
17 45 3.9%
 
Other values (11) 138 12.1%
 
ValueCountFrequency (%) 
1 1 0.1%
 
2 3 0.3%
 
4 18 1.6%
 
5 21 1.8%
 
6 20 1.8%
 
ValueCountFrequency (%) 
34 1 0.1%
 
21 5 0.4%
 
20 9 0.8%
 
19 8 0.7%
 
18 94 8.2%
 

rooms
Categorical

Distinct count3
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
1
1100
2
 
38
3
 
4
ValueCountFrequency (%) 
1 1100 96.3%
 
2 38 3.3%
 
3 4 0.4%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 3 100.0%
 
ValueCountFrequency (%) 
Common 3 100.0%
 
ValueCountFrequency (%) 
ASCII 3 100.0%
 

DK
Boolean

Distinct count2
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
0
988
1
 
154
ValueCountFrequency (%) 
0 988 86.5%
 
1 154 13.5%
 

K
Boolean

Distinct count2
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
1
788
0
354
ValueCountFrequency (%) 
1 788 69.0%
 
0 354 31.0%
 

L
Boolean

Distinct count2
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
0
1056
1
 
86
ValueCountFrequency (%) 
0 1056 92.5%
 
1 86 7.5%
 

S
Boolean

Distinct count2
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
0
1138
1
 
4
ValueCountFrequency (%) 
0 1138 99.6%
 
1 4 0.4%
 

address
Real number (ℝ≥0)

ZEROS
Distinct count100
Unique (%)8.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52.93169877408056
Minimum0.0
Maximum103.0
Zeros16
Zeros (%)1.4%
Memory size9.0 KiB

Quantile statistics

Minimum0
5-th percentile8
Q125
median55
Q383.75
95-th percentile94
Maximum103
Range103
Interquartile range (IQR)58.75

Descriptive statistics

Standard deviation29.34835162
Coefficient of variation (CV)0.5544570135
Kurtosis-1.346490866
Mean52.93169877
Median Absolute Deviation (MAD)26.00358851
Skewness0.01359352477
Sum60448
Variance861.3257428
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 14.5 16.5 18.5 ... 93.5 94.5 98.5 102.5 103. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
25 78 6.8%
 
94 73 6.4%
 
28 58 5.1%
 
92 50 4.4%
 
60 48 4.2%
 
84 40 3.5%
 
24 36 3.2%
 
26 34 3.0%
 
19 27 2.4%
 
68 25 2.2%
 
Other values (90) 673 58.9%
 
ValueCountFrequency (%) 
0 16 1.4%
 
1 9 0.8%
 
2 5 0.4%
 
4 3 0.3%
 
5 4 0.4%
 
ValueCountFrequency (%) 
103 6 0.5%
 
102 1 0.1%
 
101 2 0.2%
 
100 1 0.1%
 
99 2 0.2%
 

url
URL

UNIQUE
Distinct count1142
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
https://suumo.jp/chintai/jnc_000056246960/?bc=100189560728
 
1
https://suumo.jp/chintai/jnc_000056887146/?bc=100191990560
 
1
https://suumo.jp/chintai/jnc_000055281873/?bc=100194464659
 
1
https://suumo.jp/chintai/jnc_000056847166/?bc=100193788799
 
1
https://suumo.jp/chintai/jnc_000056559042/?bc=100182037907
 
1
Other values (1137)
1137
ValueCountFrequency (%) 
https://suumo.jp/chintai/jnc_000056246960/?bc=100189560728 1 0.1%
 
https://suumo.jp/chintai/jnc_000056887146/?bc=100191990560 1 0.1%
 
https://suumo.jp/chintai/jnc_000055281873/?bc=100194464659 1 0.1%
 
https://suumo.jp/chintai/jnc_000056847166/?bc=100193788799 1 0.1%
 
https://suumo.jp/chintai/jnc_000056559042/?bc=100182037907 1 0.1%
 
https://suumo.jp/chintai/jnc_000057329379/?bc=100194251518 1 0.1%
 
https://suumo.jp/chintai/jnc_000017362732/?bc=100192189034 1 0.1%
 
https://suumo.jp/chintai/jnc_000057139039/?bc=100193379234 1 0.1%
 
https://suumo.jp/chintai/jnc_000055062832/?bc=100187086594 1 0.1%
 
https://suumo.jp/chintai/jnc_000055972454/?bc=100193477778 1 0.1%
 
Other values (1132) 1132 99.1%
 
ValueCountFrequency (%) 
https 1142 100.0%
 
ValueCountFrequency (%) 
suumo.jp 1142 100.0%
 
ValueCountFrequency (%) 
/chintai/jnc_000055887376/ 1 0.1%
 
/chintai/jnc_000056195398/ 1 0.1%
 
/chintai/jnc_000056598962/ 1 0.1%
 
/chintai/jnc_000056840821/ 1 0.1%
 
/chintai/jnc_000057242313/ 1 0.1%
 
/chintai/jnc_000057288251/ 1 0.1%
 
/chintai/jnc_000057242039/ 1 0.1%
 
/chintai/jnc_000055281885/ 1 0.1%
 
/chintai/jnc_000056745549/ 1 0.1%
 
/chintai/jnc_000056168465/ 1 0.1%
 
Other values (1132) 1132 99.1%
 
ValueCountFrequency (%) 
bc=100194316146 1 0.1%
 
bc=100194113370 1 0.1%
 
bc=100189562097 1 0.1%
 
bc=100185047851 1 0.1%
 
bc=100194369079 1 0.1%
 
bc=100089305010 1 0.1%
 
bc=100194505194 1 0.1%
 
bc=100191259735 1 0.1%
 
bc=100188240953 1 0.1%
 
bc=100191398509 1 0.1%
 
Other values (1132) 1132 99.1%
 
ValueCountFrequency (%) 
1142 100.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

df_indexnamereal_rentageheightlevelarearoute_1route_2route_3station_1station_2station_3distance_1distance_2distance_3roomsDKKLSaddressurl
036メゾンドアーク南大井109000.00133.026.9721.02.011.022.014.015.0710.014.01010025.0https://suumo.jp/chintai/jnc_000055218472/?bc=100184907792
137メゾンドアーク南大井109000.00133.026.9721.02.011.022.014.015.0710.014.01010025.0https://suumo.jp/chintai/jnc_000055243806/?bc=100186585746
238メゾンドアーク南大井109000.00134.026.9721.02.011.022.014.015.0710.014.01010025.0https://suumo.jp/chintai/jnc_000055281890/?bc=100184907838
339メゾンドアーク南大井109000.00134.026.9721.02.011.022.014.015.0710.014.01010025.0https://suumo.jp/chintai/jnc_000055281887/?bc=100184810894
440メゾンドアーク南大井110000.00133.027.8421.02.011.022.014.015.0710.014.01010025.0https://suumo.jp/chintai/jnc_000055281884/?bc=100184907800
541メゾンドアーク南大井110000.00134.027.8421.02.011.022.014.015.0710.014.01010025.0https://suumo.jp/chintai/jnc_000055281889/?bc=100184810897
642メゾンドアーク南大井110000.00134.027.8421.02.011.022.014.015.0710.014.01010025.0https://suumo.jp/chintai/jnc_000055281888/?bc=100186585748
743メゾンドアーク南大井110000.00133.027.8421.02.011.022.014.015.0710.014.01010025.0https://suumo.jp/chintai/jnc_000055281883/?bc=100184810842
844メゾンドアーク南大井111000.00135.026.9721.02.011.022.014.015.0710.014.01010025.0https://suumo.jp/chintai/jnc_000055281896/?bc=100184810876
945メゾンドアーク南大井111000.00136.026.9721.02.011.022.014.015.0710.014.01010025.0https://suumo.jp/chintai/jnc_000055299355/?bc=100187923295

Last rows

df_indexnamereal_rentageheightlevelarearoute_1route_2route_3station_1station_2station_3distance_1distance_2distance_3roomsDKKLSaddressurl
11321888セゾン ド 西大井86000.0431.023.18210.010.05.025.034.03.0511.014.01000095.0https://suumo.jp/chintai/jnc_000025664118/?bc=100192390948
11331889JR横須賀線 西大井駅 3階建 築4年86000.0431.023.18210.010.010.025.034.03.0511.014.01000095.0https://suumo.jp/chintai/jnc_000025664126/?bc=100194200556
11341890エルフォルテ品川サウスシティ109500.0494.025.8528.02.02.07.021.033.0106.010.01010019.0https://suumo.jp/chintai/jnc_000056727786/?bc=100194389571
11351891エルフォルテ品川サウスシティ110000.0494.025.9528.02.02.07.021.033.0106.010.01010019.0https://suumo.jp/chintai/jnc_000056717242/?bc=100194038209
11361892エルフォルテ品川サウスシティ110500.0494.025.9528.02.02.07.021.033.0106.010.01010019.0https://suumo.jp/chintai/jnc_000055801386/?bc=100187599195
11371893エルフォルテ品川サウスシティ110500.0495.025.9528.02.02.07.021.033.0106.010.01010019.0https://suumo.jp/chintai/jnc_000056559127/?bc=100191572914
11381896パークスクエア戸越銀座93000.01832.025.0224.010.06.015.020.028.047.013.01010047.0https://suumo.jp/chintai/jnc_000057074536/?bc=100193531436
11391897ブリリアタワー品川シーサイド120000.014223.030.1928.01.02.07.07.033.0202.011.01010069.0https://suumo.jp/chintai/jnc_000057224210/?bc=100194321310
11401901京急本線 立会川駅 15階建 築4年88000.04154.022.7421.011.05.022.015.09.0813.018.01000025.0https://suumo.jp/chintai/jnc_000044124294/?bc=100194264044
11411902アイルカナーレ品川南88000.04154.022.7421.011.02.022.015.014.0813.010.01000025.0https://suumo.jp/chintai/jnc_000044136736/?bc=100192705326